Novel robust decision support tool assisting early diagnosis of pathological voices using acoustic analysis of sustained vowels
نویسندگان
چکیده
Effective vocal communication is critical in daily life, and 30% of the general population may suffer from a voice disorder at some point in their lives. Early diagnosis of voice pathologies facilitates mitigating symptoms and optimizing treatment for expedient recovery. Here, we studied the potential of an automated clinical decision support tool to differentiate subjects with early onset voice disorders from healthy controls simply on the basis of a single sustained vowel phonation. We characterized 200 phonations from 200 subjects with 445 speech signal processing algorithms, extracting clinically useful properties of the phonations in order to differentiate healthy and pathological cases. We selected parsimonious gender-dependent feature subsets and demonstrated that we can automatically differentiate healthy and pathological subject cohorts with approximately 91% overall accuracy. These compelling findings endorse the use of the proposed methodology towards assisting speech experts in vocal performance assessment and diagnosis of early onset voice disorders.
منابع مشابه
Pathological Voice Analysis and Classification Based on Empirical Mode Decomposition
Empirical mode decomposition (EMD) is an algorithm for signal analysis recently introduced by Huang. It is a completely datadriven non-linear method for the decomposition of a signal into AM FM components. In this paper two new EMD-based methods for the analysis and classification of pathological voices are presented. They are applied to speech signals corresponding to real and simulated sustai...
متن کاملPhonation stabilisation time as an indicator of voice disorder
There is increasing emphasis on use of connected speech for acoustic analysis of voice disorder, but the differential impact of disorder on initiation, maintenance and termination of phonation has received little attention. This study introduces a new measure of dynamic changes at onset of phonation during connected speech, phonation stabilisation time (PST), and compares this measure with conv...
متن کاملAging Female Voices: an Acoustic and Perceptive Analysis
This study examines the changes in adult female voices due to the aging process. Acoustic cues in voices that enable listeners to recognize a speaker’s vocal age are specified as well as acoustic cues that straightly indicate the speaker’s chronological age. The analysed data are recordings of the voices of 56 female speakers differing in age. The recorded speech samples include sustained vowel...
متن کاملAutomatic age detection in normal and pathological voice
Systems that automatically detect voice pathologies are usually trained with recordings belonging to population of all ages. However such an approach might be inadequate because of the acoustic variations in the voice caused by the natural aging process. In top of that, elder voices present some perturbations in quality similar to those related to voice disorders, which make the detection of pa...
متن کاملDecision Support System for Age-Related Macular Degeneration Using Convolutional Neural Networks
Introduction: Age-related macular degeneration (AMD) is one of the major causes of visual loss among the elderly. It causes degeneration of cells in the macula. Early diagnosis can be helpful in preventing blindness. Drusen are the initial symptoms of AMD. Since drusen have a wide variety, locating them in screening images is difficult and time-consuming. An automated digital fundus photography...
متن کامل